Learning to Filter User Explicit Intents in Online Vietnamese Social Media Texts

نویسندگان

  • Thai-Le Luong
  • Thi-Hanh Tran
  • Quoc-Tuan Truong
  • Thi-Minh-Ngoc Truong
  • Thi-Thu Phi
  • Xuan-Hieu Phan
چکیده

Today, Internet users are much more willing to express themselves on online social media channels. They commonly share their daily activities, their thoughts or feelings, and even their intention (e.g., buy a camera, rent an apartment, borrow a loan, etc.) about what they plan to do on blogs, forums, and especially online social networks. Understanding intents of online users, therefore, has become a crucial need for many enterprises operating in different business areas like production, banking, retail, e–commerce, and online advertising. In this paper, we will present a machine learning approach to analyze users’ posts and comments on online social media to filter posts or comments containing user plans or intents. Fully understanding user intent in social media texts is a complicated process including three major stages: user intent filtering, intent domain identification, and intent parsing and extraction. In the scope of this study, we will propose a solution to the first one, that is, building a binary classification model to determine whether a post or comment carries an intent or not. We carefully conducted an empirical evaluation for our model on a medium–sized collection of posts in Vietnamese and achieved promising results with an average accuracy of more than 90%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Similarity measurement for describe user images in social media

Online social networks like Instagram are places for communication. Also, these media produce rich metadata which are useful for further analysis in many fields including health and cognitive science. Many researchers are using these metadata like hashtags, images, etc. to detect patterns of user activities. However, there are several serious ambiguities like how much reliable are these informa...

متن کامل

Identifying User Intents in Vietnamese Spoken Language Commands and Its Application in Smart Mobile Voice Interaction

This paper presents a lightweight machine learning model and a fast conjunction matching method to the problem of identifying user intents behind their spoken text commands. These model and method were integrated into a mobile virtual assistant for Vietnamese (VAV) to understand what mobile users mean to carry out on their smartphones via their commands. User intent, in the scope of our work, i...

متن کامل

A Grouping Hotel Recommender System Based on Deep Learning and Sentiment Analysis

Recommender systems are important tools for users to identify their preferred items and for businesses to improve their products and services. In recent years, the use of online services for selection and reservation of hotels have witnessed a booming growth. Customer’ reviews have replaced the word of mouth marketing, but searching hotels based on user priorities is more time-consuming. This s...

متن کامل

UTCNN: a Deep Learning Model of Stance Classification on Social Media Text

Most neural network models for document classification on social media focus on text information to the neglect of other information on these platforms. In this paper, we classify post stance on social media channels and develop UTCNN, a neural network model that incorporates user tastes, topic tastes, and user comments on posts. UTCNN not only works on social media texts, but also analyzes tex...

متن کامل

Inferring Latent User Properties from Texts Published in Social Media

We demonstrate an approach to predict latent personal attributes including user demographics, online personality, emotions and sentiments from texts published on Twitter. We rely on machine learning and natural language processing techniques to learn models from user communications. We first examine individual tweets to detect emotions and opinions emanating from them, and then analyze all the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016